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Abstract 

We introduce a variant of the cover time of a graph, called cover cost, 
in which the cost of a step is proportional to the number of yet uncovered 
vertices. It turns out that cover cost is more tractable than cover time; we 
provide an 0(n 4 ) algorithm for its computation, as well as some explicit 
formulae. The two values are not very far from each other, and so cover 
cost might be a useful tool in the study of cover time. 

1 Introduction 

The cover time CT r (G) of a graph G from a vertex r is the expected num- 
ber of steps it takes for random walk on G starting at r to visit all vertices. 
The cover time of G is defined as CT(G) := max r(E y( G ) CT r (G). It has been 
extensively studied in various contexts; applications include the construction 
of universal traversal sequences }AKL + 79l IBro90] . testing graph connectivity 
AKL+791 IKR95J . and protocol testing |MP94j . It has been studied by physi- 



cists interested in the fractal structure of the uncovered set of a finite grid; see 
jDPRZOlj for references and for an interesting relation between the cover time 
of a finite grid and Brownian motion on Ricmannian manifolds. 

Many bounds on cover time for specific classes of graphs have been obtained, 
see e.g. [BW90I IFei951 IGWbj and references therein. However, in general it is 
very hard to obtain exact formulae for cover time, or compute it algorithmically. 
A question of |AF| that remained open for many years despite several efforts 
|Mat88l IKKLV00] . and was recently resolved in the affirmative using heavy 
probabilistic machinery |DLPj . is whether there is a deterministic algorithm 
which approximates CT(G) up to a constant factor in polynomial time. 

The following situation suggests a similar concept, which we will call the 
cover cost of G. Suppose that a truck starts at r loaded with some goods to 
be equally distributed to the other vertices of the graph and performs random 
walk, leaving the goods corresponding to each vertex the first time it gets there 
and carrying all remaining goods along. Rather than the expected number of 
steps it will take to finish this tour (which equals CT r (G)), the truck driver 
is more interested in the expected total amount of goods that will have to be 
carried. 
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Definition 1.1. Consider a cover tour of random walk on a graph G = (V, E) 
starting at vertex r. Define the cost of step i to be 1 — ^ where k is the (random) 
number of vertices visited so far excluding r, and n := \V\ — 1. Define the cover 
cost cc r (G) to be the expected total cost of all steps in the cover tour. 

As an example, let S n be a star with center r and n leaves. The problem 
of determining CT r {S n ) is the well known coupon collection problem, and one 
has CT r (S n ) « 2nlnn [AFj . It is easy to check that cc r (S n ) ~ 2n: having 
visited the first k leaves, it will take an expected steps to reach the next 
unvisited vertex, and almost each such step will cost the step from the 

newly reached leaf back to r will be cheaper by 1/n, and so cc r (S n ) =2n — 1. 

For reasons that will become clear soon, we also define the (uppercase) Cover 
Cost by CC r {G) := ncc r (G) for n = \V\ - 1. Note that cc r (G) < CT r (G) < 
CC r {G). There is a further intuitive way of defining Cover Cost. Suppose now 
that instead of a truck driver in the above game we have an electrician trying 
to visit all vertices of a graph to do some repair. Define his cost ECC r (G) to be 
the sum of the expected waiting times of his clientfl An easy double-counting 
argument implies that 

ECC r {G) = CG r {G). 

Note that, by linearity of expectation, ECC r (G) = X^ev H r {x), where H r (x) 
is the hitting time from r to x, i.e. the expected time for random walk starting 
at r to reach x. Thus 

CC r {G) = H r{x). (1) 

This formula yields a (deterministic) algorithm with running time 0(n 4 ) com- 
puting cover cost precisely. We describe this algorithm in Section [H 

In Section [3] we prove 
Theorem 1.2. For every graph G = (V,E) and every r € V, we have 

CC r {G) = J2 P - J ^ A 

t . Pxy 

where p xy is the probability that random walk from x will visit y before returning 
to x, and p r (x < y) is the probability that random walk from r will visit x before 
returning to y. 

In the case where G is a tree, Theorem 11.21 yields a simpler expression for 
cover cost involving no probabilistic parameters (see Corollary 13- X [) . and it im- 
plies a surprising connection to the Wiener index proved in |GWaj : see Section[3l 

In the case where G is the path P n of n edges we show that cover cost has 
the same order of magnitude 0(n 2 ) as cover time. However, the two parameters 
behave differently when n is fixed and we vary the starting point of our cover 
tour on P n . Both cc r (P n ) and CT r (P n ) are minimised when r is an endpoint 
and maximised when r is a midpoint, but the difference between the two ex- 
tremes behaves very differently: for CT r (P n ) this difference is also 0(n 2 ), about 
CT r (P n )/4, while for cc r (P n ) the difference is 0(n). 

1 This parameter was proposed by Peter Winkler (private communication). 
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Theorem 1.3. For a path P n with an even number of edges n, cover cost 
cc r {P n ) is minimised when r is an endpoint and maximised when r is the mid- 
point. Moreover, cc« (P n ) — cco(P n ) = n/4. 

We prove this in Section[2] in fact, we will give an exact formula for cc r (P n ) 
for every starting point r. 

In Section[5]we propose some problems on cover cost and its relation to cover 
time. 

Throughout this paper, a random walk begins at some vertex r of a finite 
graph G, and when at vertex x, it chooses one of the neighbours of x at random 
according to the uniform distribution, and moves to that neighbour. 

2 Paths 

In this section we obtain an explicit formula for the cover cost of a path, from 
which Theorem 11.31 immediately follows: 

Proposition 2.1. Let P n be the path on n + 1 vertices, indexed by 0, 1, . . . n. 

Then cc r {P n ) = + J^l. 

Proof. We are going to consider the sum of the hitting times H r (k) from r 
and apply |T]). For r = we have the well known formula Ho(k) = k 2 . For 
k > r > 0, we have Ho(k) = Ho(r) + H r (k). Combining these two formulas we 
get H r (k) = k 2 —r 2 for k > r, and similarly we get H r (k) = (n — k) 2 — (n — r) 2 = 
(r - k)(2n - (k + r)) for k < r. 
Thus we have 
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and using the formulae for the sum of the first r squares and the first r natural 
numbers we can rewrite this as 

J^H r (k)= - r 2 -nr 2 + 2n^±^- 

k 

= r(n-r). 
Plugging this into ([T]) we obtain the desired formula. □ 

3 General Formulae for Cover Cost 

Let G = (V, E) be a graph on n + 1 vertices, and fix r G V. We can express CC r 
as the sum of the contribution of each x G V to CC r as follows. Start a random 



3 



walk particle of 'charge' 1 at r, and each time the particle visits a vertex in 
y\{r} for the first time, reduce its charge by letting it continue its random 
walk with the remaining charge as long as this charge is non-zero. Note that 
the particle is stopped upon completing a cover tour. For every x € V, let D(x) 
denote the expected total amount of charge departing from x in this random 
walk. By the definitions, we have 

9^1 = CCr(G) = y D M (2 ) 

n 

xev 

Next, we are going to split each D(x) into contributions of each y 6 V as follows. 
Let V% v denote the expected number of times that random walk from x will visit 
x before visiting y for the first time; we also count the starting step as a visit 
to x, and so Vj v > 1 for every y/i. Let p r (x < y) denote the probability for 
random walk from r to visit x before y. 
We claim that 

D(x) = - ]T Pr (x<y)V^. (3) 

yeV\{x} 

To see this, think of the initial charge of the particle as having been divided 
into n 'quarks' of charge 1/n before beginning the tour, each quark labelled 
by a distinct vertex at which it is meant to be left. This allows as to write 
D(x) as the sum of the expected contributions of each quark to D. Linearity of 
expectation now implies the above formula. 

Using the formula for the expected number of repetitions of a Bernoulli trial 
until the first success, we see that V^ v = l/p xy where p xy is the probability that 
random walk from x will visit y before returning to x. Combining this to the 
above formulas we get 

CCr (G) = J2 Prix^y) (4) 

t . Pxy 

This proves Theorem 11.21 

In the case where G is a tree, the probabilistic parameters in (j4|) can be 
replaced by explicit graph-theoretic ones, yielding a pleasant formula for the 
cover cost. Given vertices r, x, y on a tree, let x A r y denote the confluent of x, y 
with respect to r, i.e. the vertex of minimal distance from r separating x from 
y. Recall that d(x) is the degree of x. We have 



Corollary 3.1 QGWaj ). Let T be a tree and r £ V(T). Th 



en 



CC r (T)= J2 d(xA r y,y)d(x). 

x,yeV(T) 

This is proved in |CWaj . where it is further used to obtain an interesting 
connection between CC r (T) and the Wiener index W(T) of T: it is proved that, 
for every tree T, 

J2 (H rv + d(r,v)) = CC r (T)+ d(r,v) = 2W(T):= ]T d(x,y). 

v<£V(T) vGV(T) x,y£V(T) 
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4 Algorithms 

Formula ([1} allows for an efficient computation of the exact value of cover time: 
fixing x G V , we can write 



for every y x E V , since the first step of random walk from y takes us to 
some neighbour of y. This, and the fact that H x {x) = 0, yields a system of n 
linear equations with n unknowns, which can be solved in time 0(n 3 ), e.g. by 
Gaussian elimination. In order to obtain CC r (G) it suffices, by ((T|), to solve n 
such systems, one for each x ^ r, and add up the values H r {x). Thus CC r (G) 
can be computed in time 0(n 4 ); in fact, with almost no additional effort we 
compute simultaneously the Cover Cost for every starting vertex r0 

A further algorithm for the computation of CC r (G) can be derived from 
the proof of Theorem 11.21 as follows. For the quantity D(r) of ([2]) we have, by 
^ and the discussion following it, D(r) = ^2 yeV \jp ry . Now each value p ry 
can be computed by solving a linear system of n equations similar to (|5|): we 
have p ry = J2{ z \rzeE}Pz(y < r) and p z (y < r) = J2{ z >\ Z z< eE}Vz'{y < r) for 
every z ^ r,y, while p y (y < r) = 1. Once D{r) has been computed, all other 
values D(x) 7 x E V\{r} can be obtained simultaneously by solving a single linear 
system of n equations: by the definition of D(x) we have 



Thus this method yields a further 0(n 4 ) algorithm for the computation of 
CC r {G). 

5 Further Problems 

The examples in the introduction show that cover cost might, depending on 
the graph, have the same order of magnitude as cover time or be quite smaller. 
How does cover cost behave in the extremal cases of 'fast graphs', i.e. when 
CT = (9(nln?i), and 'slow graphs', i.e. when CT(G) = 0(n 3 )? (It is known 
that nlnn < CT(G) < 4n 3 /27 for every graph G on n vertices jFei95j ). 

Problem 5.1. Is it true that for every M G R + there is c(M) such that for 
every graph G and r G V(G), ifCT r (G) < Mnhin then cc r (G) < c(M)n, where 
n=\V(G)\-lE 

Problem 5.2. Is it true that for every M G R + there is c(M) such that for 
every graph G and r G V(G), if CT r (G) > Mn 3 then cc r {G) > c(M)n 3 ? 

The extremal graphs for cover time are not known, although a lot of work 
has been done and graphs that are close to being extremal are known |Fei97j . 
It would be interesting to find the extremal graphs for cover cost: 

2 1 was made aware of this algorithm by Erol Pekoz. 

3 This problem arrised after a discussion with Itai Bcnjamini. 



H V {X) = 1 + 



E{ £ | 1£ 6E)_g^) 



(5) 




5 



Problem 5.3. Which rooted graph on n vertices minimises cc r (G)? Which 
maximises it? 



It would be very interesting to obtain bounds on cc r (G)/CT r (G), for this 
would allow us to use cover cost in order to obtain bounds on cover time. The 
following two problems are motivated by this. 

Conjecture 5.4. The path on n vertices rooted at an endpoint maximises 
cc r [G) / CT r (G) over all rooted graphs G on n vertices. 

Problem 5.5. Is there a graph G for which cc r (G)/CT r (G) < 1/ H(\y(G)\-i) 
where H n In n is the nth harmonic number? Which graph on n vertices 
minimises cc r (G)/CT r (G)? 
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